Partitioning and Vectorizing Binary Applications for a Reconfigurable Vector Computer

نویسندگان

  • Tobias Kenter
  • Gavin Vaz
  • Christian Plessl
چکیده

In order to leverage the use of reconfigurable architectures in general-purpose computing, quick and automated methods to find suitable accelerator designs are required. We tackle this challenge in both regards. In order to avoid long synthesis times, we target a vector coprocessor, implemented on the FPGAs of a Convey HC-1. Previous studies showed that existing tools were not able to accelerate a real-world application with low effort. We present a toolflow to automatically identify suitable loops for vectorization, generate a corresponding hardware/software bipartition, and generate coprocessor code. Where applicable, we leverage outer-loop vectorization. We evaluate our tools with a set of characteristic loops, systematically analyzing different dependency and data layout properties.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting the AltiVec Unit for Commercial Applications

The introduction of the PowerPC 970 JS20 blade server opens opportunities for vectorizing commercial applications using the integrated AltiVec unit. We examined the vectorization of applications from diverse fields such as XML parsing, UTF-8 encoding, life sciences, string manipulations, and sorting. We obtained performance speedups (over optimized scalar code) for string comparisons (2-3), XML...

متن کامل

DPML-Risk: An Efficient Algorithm for Image Registration

Targets and objects registration and tracking in a sequence of images play an important role in various areas. One of the methods in image registration is feature-based algorithm which is accomplished in two steps. The first step includes finding features of sensed and reference images. In this step, a scale space is used to reduce the sensitivity of detected features to the scale changes. Afterw...

متن کامل

A Vectorizing SUIF Compiler

With advances in VLSI technology, it is now possible to implement vector processors on a single chip. Vector architectures are ideal for providing the cost-effective, real-time performance demanded by graphicintensive applications such as MPEG players and video conferencing software. However, these vector microprocessors lack compilers and instead rely on libraries of hand-written assembly func...

متن کامل

Compilation for Scalable, Paged Virtual Hardware

Reconfigurable computing devices such as field programmable gate arrays (FPGAs) have demonstrated 10x-100x gains in performance and functional density over microprocessors for a variety of applications [13], yet their commercial use is limited primarily to serving as single-task ASIC replacements, which largely ignores their programmability and severely limits their applicability. SCORE (Stream...

متن کامل

A Vectorized Hash-Join

A vector instruction set is a well known method for exposing bandwidth to applications. Although extensively studied in the scientific programming community, less work exists on vectorizing other kinds of applications. This work examines vectorizing a traditional database operation, a Grace hash-join. We how to vectorize both the hash and join phases of the algorithm, and present performance re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014